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Abstract 

We pick up the regime switching model for asset returns introduced 
by Rogers and Zhang. The calibration involves various markets including 
implied volatility in order to gain additional predictive power. We focus on 
the calculation of risk measures by Fourier methods that have successfully 
been applied to option pricing and analyze the accuracy of the results. 

1 Intro 

This article is based on the innovative approach for modeUing asset returns 
introduced by Chris Rogers and Liang Zhang in their paper "An asset return 
model capturing stylized facts" [T^. The model captures several stylized facts 
of asset returns which is achieved by modelling returns with a hidden two- 
state Markov chain representing states of the economy. Conditional on these 
states returns are modelled by generalized hyperbolic distributions. We shortly 
describe the model and its calibration in Section [21 

We apply the model to equity, bond and commodity markets. Furthermore 
we include the VDAX index, an index that represents implied volatility for the 
future 45 days, in the calibration procedure and hope to increase the predic- 
tive power of the model by this forward looking ingredient. The final aim is 
to calculate value-at-risk and expected shortfall in this framework by Fourier 
techniques from option pricing theory [U |2j |6] in Section |3] 

In the following we recall some stylized facts exhibited by financial time 
series across a wide range of instruments, markets and time periods. 
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Usually the distribution of daily log-returns is leptokurtic which means that 
it shows fat tails and a high peak at its median. Since the normal distribution 
is mesokurtic as its excess-kurtosis is zero, it is not fully adequate for modelling 
such returns. In Figure [l] the fat tails in the return histogram and the bad fit of 
the normal distribution to the data is clearly visible. The returns were calculated 
for daily closing prices of the DAX Index in the period from 18 November 2005 
to 30 December 2009. The second plot shows a QQ-Plot of the data versus 
the theoretical quantiles of the normal distribution and again the deviation is 
visible. The Jarque-Bera statistic confirms the departure from normality at a 
one percent level of significanc^ . 

Density estimation 
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Figure 1: Histogram of daily DAX log returns 



^p-value < 2e-16. 
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Figure 2: QQ-Plot of daily DAX log returns 



Furthermore the autocorrelation of asset returns is usually observed to be 
insignificant while the autocorrelation of absolute returns is significantly differ- 
ent from zero up to surprisingly high lags. This is known as volatility clustering. 
Large movements tend to be followed by large movements, irrelevant of the sign. 
In Figure |3] we see a plot of the autocorrelation function (ACF) of DAX returns 
with no significant autocorrelation whereas the plot of the ACF of absolute 
returns shows high autocorrelations and a slow decay even at high lags. 
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Autocorrelation of simple and absolute log returns of the DAX Index 
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Figure 3: Autocorrelation structure daily DAX log returns 



Finally, there is an asymmetry between large losses and large gains, large, 
sudden draw-downs can be observed. Gains are usually made slowly and in 
a less sharp pattern. As a consequence the distribution is skewed. This fact 
applies to stocks but not necessarily to indices. In Figure [T] we see that the 
distribution of the returns of the DAX index is fairly symmetric. 

Rogers and Zhang develop a model that captures all these facts. The 
main idea is to use a hidden Markov model with a non-Gaussian distribution of 
returns given the states. In the following we shortly describe their model and 
its calibration. 



2 The Hidden Markov Model of Rogers and Zhang 
2.1 General Properties 

Hidden Markov models, in finance sometimes applied as regime switching mod- 
els, are mixture models for financial time series where the distribution of the 
series is determined by an unobservable Markov chain. Whereas the chain is 
unobservable, the returns are observed and conditional statements can be done. 

In this model we consider an ergodic two-state first order Markov chain 
(Cn)neN • Let P — {pij} e denote the transition matrix. The /c-step transi- 
tion probabilities are denoted by p^^ = P [Cn = j\£.n-k = i] and P*^*^) — {Pij} '■— 

denotes the corresponding fc-step transition matrix. 

For modelling the returns, two sequences (X^)^^j^ for i <E {1,2} independent 
of (^„)„gN are defined. In each state i E {1,2} we have X'^^ ^ Fi for n e N. 
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The concrete choice of the distributions Fi, i E {1,2}, will be revealed further 
below. 

Based on these definitions the returns are modelled by 

2 

rr, = Y,Mi,.=^}K, forneN. (1) 

As (^n)neN is crgodic we can determine the stationary distribution tt — (tti, 7:2)' ■ 
The stationary distribution is given by the normalized left eigenvector of P, so 
the calculation of tt for the two-state model is easily done and we get 

( 1-P22 \ 
2-P11-P22 / 

The model incorporates the stylized facts discussed above. To show this we 
cite the formula for the covariance structure of (Ij'nDngN (^^^ |13|). 

Proposition 2.1. Let g be a function of the return, such thatK [g{rn)\S,n — Si], 
the expected value given the state, exists for i G {1,2}. Then the autocovariance 
of 9 {rn)neN 9^'"en by 

COV[.g(r„),g(r„+fc)]=7rGP(''^)Gl-(7rGl)% fc = l,2,... (2) 

where G := diag{Gi,G2) and Gi := E [(7(r„)|^„ = Si] , i G {1,2}. 

One stylized fact mentioned is that log returns do not exhibit significant 
autocorrelation. The following proposition of Rogers and Zhang |12| assures 
that the model captures this property. 

Proposition 2.2. Let fi^ E [r„|^„ = i] for i e {1, 2}. If fi^ ^ ^ for i e {1, 2} 

then 



E [r„r„+fc] = /Lt^ (3) 

for any k > and n G Z. 

It is worth noting that the proof of this statement does not require that 
the Markov chain has only two states but it is true for any finite number of 
states as long as the expected returns are equal. Rogers and Zhang L12j find 
that a two state Markov chain suffices to reproduce the stylized facts mentioned. 
Furthermore the economic interpretation as riskier and less risky times is clear 



in this set-up. Our findings in the calibration example in Subsection 2.2 support 
this view. 

So far we have assumed that X'!^ ^ Fi V n and i G {1,2}, where Fj is 
a general distribution function. In [H] the class of Generalized Hyperbolic 
(GHYP) distributions is chosen. There are several alternative parametrizations 
for the GHYP family, each with different parameter ranges. The following 
parametrization is the most widely used in literature. 

Definition 2.1. The univariate density of a GHYP {a, ji. A, 5, /i) is given by 



where Kx is a modified Bessel function of the second kind. 
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The generalized hyperbolic distribution is a flexible class as it covers a wide 
range of well-known distributions as limiting or subclasses. One can obtain the 
normal distribution as a limiting case for (5 — > oo and bja — > . For A = 1 
these are the hyperbolic distributions, whereas for A = —1/2 we get the normal 
inverse Gaussian distribution. Another important limiting case is the Student-t 
distribution, which is a limit for A < and a = /? = = 0. Furthermore, using 
the GHYP distribution we can model the observed heavy tails. In particular 



GHYP 



|A-1 ^ly±a+0)x 



for 



±CX3. 



Recall the moment generating function in the chosen parametrization. 

Definition 2.2. The moment generating function (MGF) of a generalized hy- 
perbolic distribution is 



MGF{t) 



/?2 



a2 - (/? + ty 



A/2 



for \l3 + t\ < a. 



In Subsection |3 . 1 1 we will apply Fourier techniques to calculate the VaR and 
the ES. It can be shown that the MGF is real analytic, therefore MGF is a 
holomorphic function for complex z with \z\ < a — 13. So the characteristic 
function can be obtained through (t>{t) — MGF(i<). 

In the plot in Figure |4] we illustrate the application of GHYP in the case of 
log returns of the DAX index. In the histogram a fitted generalized hyperbolic 
distribution is superimposed together with a fitted normal distribution. The 
superior fit of GHYP in the center as well as in the tails is plain to see. 



Density estimations 




- GHYP 
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Figure 4: Histogram of daily DAX log returns 
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2.2 Calibration example 

Given the fact that the model captures all the stylized facts mentioned, it is re- 
markable that it still can be calibrated using maximum likelihood. Additionally 
to the likelihood function a term that penalizes a deviation of the theoretical au- 
tocovariance function of absolute returns given by (|2| from the observed values 
is incorporated (for details we refer to [H]). We have implemented the model 
and its calibration in R [TD] using the functions for the GHYP distribution 
provided in [7]. 

We calibrated the model to four indices. Note that as in [T^ the Markov 
chain is one and the same for all indices. This will help us to derive an economic 
interpretation of the states. Except for the commodities index we focused on 
European indices in order to avoid closing time differences (see for example [11] ) . 
We considered the following indices: 

Stocks For stocks the DAX index, the German stock index was chosen. The 
DAX index comprises of 30 major German companies and is one of the 
most important stock indices in Europe. 

Bonds As a representation for bonds as less risky investment opportunity the 
REX index, the German Rentenindex, was selected. The REX measures 
the performance of 30 German government bonds. Note that the REX is 
not investable but it still serves as a good proxy for the bond market. 

Commodities The Dow Jones UBS Commodity Index (DJUBS) was consid- 
ered. The Index is a futures based index which includes the following 
commodities markets: energy, precious metals, agricultural goods, indus- 
trial metals, and live stock. 

Volatility The VDAX the German volatility index was also included as it tries 
to capture implied volatility of the DAX in a forward looking way. The 
VIX, the analogous to the VDAX is known as a 'fear' index. Although the 
returns of a volatility index are of a quite different nature than stock index 
returns (compare estimated densities in of DAX returns resp. VDAX in 
Figure [t]) we included the VDAX in order to get more predictive power 
for the risk measures. 

The time series consists of 1609 daily returns from 18 November 2005 to 27 
June 2012 This data set was divided into two parts one from 18 November 
2005 to 30 December 2009 as calibration period for the model and from 04 
January 2011 to 27 June 2012 as out-of-sample period used for backtesting. 
Figure |5] shows the movement of the four indices for the whole time period 
from 18 November 2005 to 27 June 2012. We can clearly see the spike in 
the VDAX beginning in late 2008 and the corresponding decline in the DAX 
and the DJUBS. In the last year the Fukushima crisis is visible as a slight 
downward movement and of course the European sovereign-debt crisis with the 
stock market crash beginning in August 2011. 

^The data was obtained via yahoo finance. 
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Time Series Indices 
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Figure 5: Market performance 2005 - 2012 

Remark 2.1. To estimate tlie parameters in this model with two states, one 
needs a time series long enough as there are 38 parameters to estimate, namely 
ten parameters for a single asset plus two transition probabilities. As we require 
the same means in both states there are four parameters less to estimate in total. 

The next figures show the results of the maximum likelihood estimation. In 
Figure [6] the observed autocovariance structure versus the autocovariance in the 
estimated model is plotted. We see that we get a fairly good fit. Figure [7] shows 
the estimated density functions for the considered indices. The densities are 
quite symmetric (except for the VDAX), as expected. Furthermore all fitted 
distributions exhibit fat tails and the distribution in one state is clearly riskier 
(more leptokurtic) than in the other state. Therefore we can conclude that 
the model has an economic meaning in the sense that one state represents an 
optimistic and the other state a pessimistic market. Another property is that 
if we are in the optimistic state for the stock index we are in the pessimistic 
state for bonds and vice versa. Which is again our natural intuition, that in 
optimistic upward moving market we tend to invest in stocks. 

The optimistic state for the bond index is the same as the optimistic state 
for the commodity index which is surprising. If we think of simple correlation 
the commodity index should go together with the stock index. Maybe this fact 
is due to the broadness of the commodity index. 
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ACF theor. vs. estimated DAX 
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Figure 6: ACF theoretical vs. observed DAX index 
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Figure 7: Density function for both states for the four indices 
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Having calibrated the model to the data we turn to applications. Rogers and 
Zhang present a number of applications fl2] whereas we focus on an application 
to risk measures. 



3 Risk measures 

The first object that we want to address (see also [T^]) is the posterior probability 
of being in a certain state. This means that having observed a certain return it 
tells us how our view of the current state changes. 

Definition 3.1. The posterior probability p„ = (p^jp^J-^ at time n can be 
calculated by 



Pn^iPF{rn,ei,02) 
Pr,-iPF{rn, 61,62)1' 



(5) 



where 



Fir;6,,62) = dtag ]]_ f{r' ;6i),l[ f{r^ ;6i) , 

m is the number of markets, / the GHYP density and 6-f with i G {1,2} and 
j e 1, . . . , TO denote the parameters of the given market in the respective state. 

The next plot shows the posterior probability of being in the optimistic state 
for the DAX index together with the performance of the index. 



Time Series DAX 



S 5000 

"S 1.00 



























































































1 










t 


i] 








































111 








201c 


-01 201 


-07 201 


-01 201 

Date 


-07 20 


1 

2-01 201 


-07 



Figure 8: Posterior probability - DAX index 
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We see that the posterior probability reacts relatively quickly to losses in the 
index. In the downturn beginning in August 2011 the model captures this tense 
market state and swings into the pessimistic state. The posterior probability 
takes values close to zero or close to one most of the time telling us quite 
clearly in which state we are. Observing this object could be of great value in 
risk management. Recall that the calibration period ended in 2009 and that 
thus the data shown is purely out-of-sample. Despite this fact the posterior 
probability shows a satisfying pattern even in 2011 which is already more than 
one year after the calibration period. 

Additionally, the posterior probability measure is an input needed for the 
calculation of risk measures as we will see in the following subsection. 

3.1 Value-at-Risk 

In order to derive the value-at-risk (VaR) by Fourier methods we cite the char- 
acteristic function from jT^ . 

Definition 3.2. The characteristic function of the N-period return distribution 
can be calculated by 

Mt)^P.[p[^f ^))"l (6) 

where p„ is the posterior distribution at time n, given in (|5|, P the transition 
matrix and 0^ i € {1,2}, are the univariate one-period characteristic functions 
of the asset return for the two states. 

Note that by this definition the market returns of all assets that are modelled 
together influence the posterior view in p„ and thereby the distribution of the 
specific asset considered. For example in the model that we calibrated, an 
increase in implied volatility as measured by VDAX will change this view and 
therefore change the posterior marginal distribution of the DAX. 

For the purpose of backtesting the VaR below, we will stick to the case 
= 1. But the general form ^ is needed for holding periods of more than one 
day. 

Definition 3.3. The value-at-risk is defined by 

VaRa = inf {x e M : F{x) > a} , 

where a is the confidence level and F the cumulative distribution function 
of the asset return. If F is continuous and strictly increasing the VaR is the 
unique solution of F{VaRa) = a. 

The general approach to the calculation of risk measures that we apply to 
this model uses Fourier techniques from option pricing theory (see e.g. Lewis' 
method |2 Section 11.1.3] or [6J). To this aim we extend the definition of 
the characteristic function (/) to a function of a complex argument. Due to [?, 
Theorem 7.1.1] this complex Fourier transform is well defined on a strip of 
regularity in the complex plane, denoted by Sx- Let / be the density function 
and (f> the corresponding characteristic function. Then by an application of the 
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Plancherel (sometimes called Plancherel-Parseval) theorem for a function g{x) 
it holds that 



1 



f{x)g{x)dx = — / (f){u)g{u)du, (7) 



where g is the generalized Fourier transform of g given by 

g{u) = / e-™^g(z)dz, 



and (5 = for z e Sx is chosen such that the integral on the right hand side 
converges. 

The final aim is to represented the risk measure of interest such that it can 
be efficiently calculated by the Fast Fourier Transform (FFT). For details of an 
implementation of FFT see e.g. [T]. 

Proposition 3.1. Let cj) be the characteristic function of the distribution func- 
tion F and S > 0, then F can be computed by the following formula 

F(y) ^ f r (8, 

27r 5~iv 
Proof. We can apply the general approach and find 

oo 



g{u) = / e ™'^l{^<y}da; 



OO 

ip-iuy 



— lU x=-ao U 

for ~ 6 > such that the above integral converges. Applying Q and a 

variable change we get 

27r J-oo+iS u 
2n 6 -IV 

□ 

The structure of ([s]) allows us to compute the distribution function efficiently 
by FFT. Finally a simple root search delivers the value-at-risk. 

We apply this approach to the DAX index which is calibrated on data from 
November 2005 to December 2009 as described above and analyze the in-sample 
fit first. Figure [9] shows the index returns together with the VaRgc,% estimate 
resulting from dsF. The dashed horizontal lines are drawn at the VaRgij% in the 
two states respectively. Changes in the VaR estimate in the area between the 
dashed lines in Figure [9] are due to changes in the posterior distribution (|5| , 
which is plotted below, reflecting the combined development of the market in- 
dices that are used in its calculation. 

We backtest the VaR model using the binomial test [5 . Observing 29 vi- 
olations with a sample of 999 days gives a p-value of 0.999 and thus strongly 
supports the accuracy of the model. 



12 



To test the null hypothesis of random occurrences of VaR-violations we use 
the runs test as implemented in the R package tseries [?] and find a p-value 
of 0.44 which allows us to accept this null hypothesis. This confirms that the 
posterior distribution is reasonable sensitive to changes in the riskiness of the 
asset return. 



DAX returns with 95% VaR 
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Figure 9: 95% value-at-risk for the DAX in-sample 



As a second risk measure we treat the calculation of expected shortfall in 
this model. Note that the distribution of asset returns is continuous. Thus we 
can state the following definition: 

Definition 3.4. The expected shortfall (ES) is defined by 

ESc{X) = E [X\X < VaRa,{X)] , 

where a is the confidence level and VaR^ is the value-at-risk to the level a as 
defined in Definition 13.31 



Proposition 3.2. Let <j) he the characteristic Junction of the distribution func- 
tion F and 6 > 0, then the distribution function can be computed by the following 
formula 

r^s 1 f°° -^vY^.R " w) VaR„ - 1 ^ , 

ES^iX) = J J ^iv + ^S)dv. (9) 

Proof. First we calculate 

i-y 

xf{x)dx, 

oo 
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by the general approach ^ above and then plug in the value-at-risk for y and 
apply the FFT. Applying integration by parts we get the transform 



oo 

iux -1 



g{u) = I xe ™^l{^<j^}dx 

^—iux 
: — dx 



e 

X- 



oo 
iux y 



~iuy Q-iuy 



(—iu)^ {—iu)'' 



{-iuy- 1), 



for Q{u) = S > such that the above integral converges. Applying Q and a 
variable change we get 

^ /■oo+iS ^—iuy \v—u—iS\ 

= ^LJ 

The result follows by plugging in the VaR for y and using the definition of 
expected shortfall. □ 

The form of ^ allows for an application of the FFT. Reading off the value 
at the value-at-risk from the transformed sequence gives the expected shortfall. 



Figure 10 shows the index returns together with the ESq^% estimate resulting 
from Q in the out-of-sample period. The dashed horizontal lines are drawn at 
the ESq^% in the two states respectively. Again we see the estimations of ESg^% 
wander up and down between these two extremes as the posterior probability 
changes. The ESg^% is breached only once in the out-of-sample period. 

Concerning VaRgQ% in the out-of-sample period we observe 10 breaches dur- 
ing 610 days which gives a p-value of 0.99 in the binomial test. A runs test for 
the mixing of the breaches gives a p-value 0.6772. These test results allow us 
to accept the model although the small number of breaches could indicate that 
the model calibrated in the highly volatile years around 2008 is slightly too con- 
servative for the out-of-sample period. In an industrial application the model 
should be recalibrated on a regular basis, e.g. once a month. Note that due 
to the high number of parameters the model is calibrated on 3 years of data. 
Thus more frequent calibration will not change the model significantly as the 
influence of the data added is rather small. 



In Figure [TT] we compare the value-at-risk derived from the regime switching 
model to the value-at-risk from a much simpler approach in order to see how 
much we gain by introducing the two states. As a comparative model we simply 
assume a GHYP distribution of the returns. Doing so we loose some of the 
features mentioned above but we still can model heavy tails and skewness. While 
we have used a history of approximately 3 years for calibrating the regime 
switching model we need much less data for the simple model and use this 
advantage. Namely, in each time step we look back for 250 business days and fit 
the distribution and calculate the VaR. This VaR is then compared to the return 
of the following day. Figure [TT] shows that this simple model performs much 
worse. It reacts too slowly to changing market conditions and accumulates 36 
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breaches in the testing period whereas the VaR of the regime switching model is 
breached only 10 times. This illustrates the benefit of the modelling approach. 



PAX returns with 95% ES 
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Figure 10: 95% expected shortfall with posterior probability 
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Figure 11: 95% VaR of the regime switching model compared to the simple 
model 



4 Conclusion 

In this article the model introduced by Rogers and Zhang [T^ is applied to 
calculation of the widely used risk measures value-at-risk and expected shortfall. 
In order to find a proper description of market conditions we calibrate the model 
to stocks, bonds, commodities and volatility. The volatility index is included in 
the calibration in order to increase the predictive power of the resulting model. 
Fourier methods from option pricing are applied to derive formulas that allow 
for an application of FFT to calculate the risk measures. Finally in-sample and 
out-of-sample analysis show the benefit of the method proposed. 

Although the calibration boils down to a maximum-likelihood estimation a 
proper choice of the length of the calibration period turns out to be crucial 
which can be seen as a weak point of the method. 
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